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SPECIFICATION 

NOVEL POLYPEPTIDE HAVING WATER CHANNEL ACTIVITY AND DNA 
SEQUENCE 

5 

TECHNICAL FIELD 
The present invention relates to a novel, htiman adipose 
tissue -derived polypeptide having water channel activity and 
to a DNA sequence encoding for the polypeptide. 

10 

BACKGROUND ART 
The permeation of water through a cell membrane generally 
occurs slowly by way of diffusion into the lipid bilayer which 
is the main structure of the cell membrane. Recently, however, 

15 it was discovered that, in certain kinds of cells, water is 
transferred rapidly through the cell membrane, suggesting the 
involvement in the above phenomenon of some membrane protein 
selectively permeable to water. Thereafter, such membrane 
proteins of various kinds have actually been isolated. Such 

20 membrane proteins are designated as water channels. In this 
specification, the function of the above water channels which 
has selective permeation of water through the cell membrane is 
referred to as "water channel activity" . The water channels 
may be permeable to water alone or permeable to not only water 

25 but also low-molecular-weight substances such as glycerol and 
urea. 

As a membrane protein having such water channel activity, 
there have been isolated a group of membrane proteins known as 
aquapolins (AQPs). Furthermore, some aquapolin genes have so 
30 far been cloned, and aquapolins such as AQPl through AQPS, 
FA-CHIP and AQP-TTIP have been discovered in mammals, 
amphibians, plants, etc. [cf. e.g. Akira Sasaki, Igaku no Ayumi 
(Advances in Medicine), vol. 173, No. 9, 1995]. 

P. Agre et al. reported, in Science (vol. 256, pp. 385 
35 to 387, 1992) that Xenopus laevis oocytes in which the in vitro 
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transcript RNA for CHIP28, the current designation of which is 
AQPl , had been introduced showed increased water permeability. 
In Science (vol. 264, pp. 92 to 95, 1994), B. A. van Oost et 
al. disclosed the amino acid sequence of human AQP2 and 
5 suggested that this should be involved in vasopressin -dependent 
urine concentration . 

In Proc. Natl. Acad. Sci. USA (vol- 91, pp. 6269 to 6273, 
1994), Ishibashi et al. disclosed the nucleotide sequence of 
the gene for renal collecting tubule -derived AQP3 and the amino 
10 acid sequence encoded thereof. Ishibashi et al. confirmed its 
water channel activity by injecting the AQP3 cRNA into Xenopus 
laevis oocytes and measuring the water permeability thereof. 

p Ishibashi et al. reported that this AQP3 transported not only 

water but also nonionic small molecules such as urea and 

IS 15 glycerol. 

yj In Proc. Natl. Acad. Sci. USA (vol. 91, pp. 13052 to 13056, 

1994) , J. S. Jung et al. reported about the isolation of AQP4. 
p This AQP4 is known to occur most abundantly in mammalian brains 

^ and have mercury resistance. In J. Biol. Chem. (vol. 270, pp. 

g 20 1908 to 1912, 1995), S. Raina et al. who prepared rat salivary 
O gland- derived AQP5 cDNA describe the nucleotide sequence of the 

r: cDNA and the amino acid sequence encoded thereby. S. Raina et 

y, al. cloned the cDNA by utilizing the occurrence of an NPA 

sequence and confirmed its function by observing that the cRNA 
25 enhances the water permeability of Xenopus laevis oocytes. 

The aquapolin family mentioned above is considered to be 
involved in water metabolism in mammals and, for example, it 
has been confirmed that AQP2 is found only in the renal 
collecting tubule luminal membrane, which is indicative of its 
30 close association with the vasopressin-urea concentration 
system, and its involvement in renal diseases has become 
acknowledged. Therefore, such membrane proteins having water 
channel activity are of importance in any attempt to develop 
novel therapies for water-associated diseases . 
35 Meanwhile , the expression of the aqupolin family 
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mentioned above has been confirmed in such organs as kidney, 
brain, gall bladder, eye, intestine, salivary gland and 
bronchus but there is no report as yet about the occurrence of 
membrane proteins having water channel activity in other organs 
5 or tissues, particularly in adipose tissue. 

SUMMARY OF THE INVENTION 
In view of the above-mentioned state of the art, the 
present invention has for its object to provide a novel membrane 
10 protein having water channel activity and a DNA sequence 
encoding for the polypeptide. 

The present invention is related to a novel polypeptide 
having water channel activity which has the amino acid sequence , 
-£ within the molecule thereof, shown in the sequence listing under 

15 SEQ ID N0:1. 

h The present invention is also related to a nucleotide 

'Al sequence itself which codes for a polypeptide having, within 

S the molecule thereof, the amino acid sequence shown in the 

~ sequence listing under SEQ ID NO:l and having water channel 

Ti_ 20 activity . 

□ The present invention is further related to the DNA 

S sequence shown in the sequence listing under SEQ ID NO: 2. 

C The present invention is still further related to a 

polypeptide having water channel activity which has the amino 
25 acid sequence, within the molecule thereof, encoded by the 

nucleotide No. 173 to No, 1198 of the nucleotide sequence shown 

in the sequence listing under SEQ ID NO: 2. 

DETAILED DESCRIPTION OF THE INVENTION 
30 In the following, the present invention is described in 

detail. 

The polypeptide of the present invention has the amino 
acid sequence shown in the sequence listing under SEQ ID N0:1. 
This polypeptide has a sequence composed of three amino acids , 
35 namely asparagine -proline -alanine, as the amino acid Nos. 195 



to 197. However, the characteristic feature common to the 
so-far known AQPs, that said asparagine -proline -alanine 
sequence occurs twice, is not found in the polypeptide of the 
present invention. That this polypeptide has water channel 
activity can be confirmed from the fact that it enhances the 
water permeability of Xenopus laevis oocytes. 

The above polypeptide may be generated by translation by 
a protein synthesis system constituted, in vivo or in vitro , 
based on the nucleotide sequence coding for the amino acid 
sequence of said polypeptide. The nucleotide sequence of the 
present invention substantially has a region coding for the 
amino acid sequence of said polypeptide and, where necessary, 
may contain one or more other regions such as a promoter region. 
In the protein synthesis based on genetic inf ojrmation, the 
information carried by the gene DNA is transcribed into mRNA 
as the result of DNA-dependent RNA synthesis aided by RNA 
polymerase. And, this mRNA is translated into the amino acid 
sequence in a tRNA- containing protein synthesis system. 
Therefore, the nucleotide sequence of the present invention 
includes not only the DNA sequence but also the RNA sequence. 
Furthermore , since it is generally known that , for an amino acid, 
there is one or a plurality of codons corresponding thereto, 
it is a matter of course that the above-mentioned nucleotide 
sequence is not limited to only one sequence but may include 
nucleotide sequences resulting from substitution of another 
synomyous codon coding for the same amino acid. 

The above polypeptide can be formed based on the genetic 
information carried by the DNA sequence shown in the sequence 
listing under SEQ ID NO: 2 . This polypeptide is encoded by that 
portion of the nucleotide sequence shown in the sequence listing 
under SEQ ID NO: 2 which ranges from the nucleotide No. 173 to 
No. 1198. Of the DNA sequence shown in the sequence listing 
under SEQ ID NO: 2, the nucleotide sequences other than the 
portion of said nucleotide ntirabers are noncoding regions, among 
which the polyadenylation consensus sequence AATAAA occurs at 
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the nucleotide No. 1234 to No. 1239. Other possible reading 
frames of said DNA sequence shown under SEQ ID NO: 2 can be 
excluded from consideration, since the polypeptides encoded are 
very small- sized, hence considered to be incapable of 
5 performing any water channel function. 

It has been confirmed by the inventors that the full- 
length sequence of the above nucleic acid bases has no 
counterpart sequence either in GenBank or in dbEST. 

The polypeptide of the present invention has water 
10 channel activity in adipose tissue. While adipose tissue is 
distributed in various parts of the living organism, the 
polypeptide of the present invention has an action to control 
the transfer of water in such adipose tissue and is expected 
"~ to be effective in upholding normal functions of adipose tissue 

= 15 at various sites. 

2 BEST MODES FOR CARRYING OUT THE INVENTION 

i. The above-mentioned DNA sequence given under SEQ ID NO: 2 

r corresponds to the nucleotide sequence of cDNA obtained from 

^ 20 human adipose tissue by cloning. Human adipose tissue is a 
tissue which stores fat as energy reserves . It is known that 
r: various proteins are formed in this adipose tissue. A 3'- 

directed DNA library is known as a cDNA library from which the 
genes actually expressed in this adipose tissue or, in other 
25 words , the mRNA composition in this adipose tissue can be copied 
faithfully. This 3 '-directed DNA library contains only those 
specified 3 ' - terminal regions of mRNAs which range from poly (A) 
to the Mbo l site which is a restriction enzyme recognition site 
upstream of saidpoly(A) and, therefore, said library is suited 
30 for template preparation by the PGR technique. Therefore, by 
extracting a clone from this library and using it to determine 
a longer nucleotide sequence including the amino acid coding 
region from this complete adipose tissue cDNA library, it 
becomes possible to obtain the genetic information concerning 
35 the protein which is actually formed in adipose tissue. The 
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DNA sequence of the present invention as shown under the 
above-mentioned SEQ ID NO: 2 is found by such cloning. A method 
of obtaining the cDNA by cloning from human adopose tissue is 
now described in detail. 
5 Known as said method is , for example , the method described 

in Biochem. Biophys. Res. Commun., 221 , 286 to 289 (1996). 
According to this method, the total RNA is first separated from 
adipose tissue and, when necessary, purified to give poly (A) 
RNA. For this purification, coiranercially available 
10 purification kits can be used. For example, Pharmacia's Quick 
prep mRNA purification kit or the like in which oligo(dT)- 
cellulose and various buffers are used in combination can 
p judiciously be employed. Then, a double- stranded cDNA is 

synthesized using a pUC19 system vector primer and the 
J! 15 double- stranded cDNA so synthesized is selectively cleaved with 
y3 the restriction enzyme Mbo l (which recognizes the nucleotide 

S sequence GATC) . On that occasion, the GATC sequence on the 

D vector molecule side, which can be methylated to give C^ATC when 

^ replication is effected in dam* bacterial cells , is not cleaved 

yi 20 withMbQi. As the cleaved cDNA is subjected to self -cyclization 
using E. coli ligase, a plasraid containing a cDNA fragment 
extending from poly(A) to the nearest Mbo l site is completed. 
This plasmid is introduced into Escherichia coli , followed by 
cultivation and selection of a transformant E. coli colony. 
25 Then, the cDNA in said colony is amplified by the PGR technique 
using appropriate PGR primers . 

On the other hand, the full-length double -stranded cDNA 
synthesized using the pUC19 system vector primer is cleaved at 
the 5' end using T4 pol3^erase and subjected to cyclization 
30 using T4 ligase and introduction into Escherichia coli for 
transformation. From among the thus-obtained transformant 
colonies, the desired colony is obtained by screening using, 
as a probe, a labeled form of the adipose tissue- specif ic cDNA 
obtained from said 3 '-directed DNA library by the method 
35 mentioned above. , The insert cDNA in this colony is amplified 



by the PGR technique using appropriate PGR primers. The 
amplification product is purified and, after sonication, 
subcloned into the M13 phage. 

The nucleotide sequence of the thus -cloned cDNA can he 
determined, for example, by reaction with a primer dye, 
purification and analysis using an automated sequencer or the 
like. In this manner, the DNA sequence of the present invention 
can be obtained. 

The polypeptide of the present invention has water 
channel activity. This water channel activity can be confirmed 
by observing an enhancement of water permeability in Xenopus 
1 aevis oocytes. It is known that no AQP family gene has been 
expressed in Xfinnpus laevis oocytes and, therefore, any 
increase in water permeability as caused by the injected mRNA 
can be easily confirmed. For this reason, said oocytes are 
widely used in confirming water channel activity . For example , 
in Proc. Natl. Acad. Sci. USA, vol. 91, pp. 6269 to 6273 (1994) , 
Ishibashi et al. confirmed the water channel activity by 
inserting the AQP3 cDNA into the pSP64T-derived BlueScript 
vector, synthesizing the cRNA using T7 RNA polymerase, 
injecting this cRNA into Xenopus laevis oocytes and, after 48 
to 62 hours of incubation following injection, observing an 
increase in water permeability and in the volume of the oocyte . 

The polypeptide encoded by the DNA sequence of the present 
invention can be identified by analyzing the amino acid sequence 
of the polypeptide synthesized in an Escherichia coli protein 
synthesis system constituted in yitro . In this case, the 
methods of identifying N-and C-terminal sequences of expression 
products as described in Shin Seikagaku Jikken Koza 
(Experiments in Biochemistry, A New Course) 1 (published by 
Tokyo Kagaku Dojin) , pages 22 to 24 can be employed. 

The following examples illustrate the present invention 
in further detail. These examples are, however, by no means 
limitative of the scope of the present invention. 
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Example 1 

Dp. termination of DMA nucleotide sequence 

According to a recent report [Biochem. Biophys . Res. 
Commun., 221 . 286 to 289 (1996)], a DNA for an adipose 
tissue-specific collagen-like factor has been cloned using a 
3 '-directed DNA library containing only those specified 3'- 
terminal regions of RNA molecules which range from poly{A) to 
the restriction enzyme Mbo l site upstream thereof [Nature 
Genet., 2, 173 to 179 (1992)], which library makes it possible 
not only to identify the gene under expression but also to 
examine the frequency of expression or amount of expression. 
In accordance with the method described in the above report, 
the DNA nucleotide sequence of the present invention was 
determined using an adipose tissue- specif ic 3 '-directed DNA 
library . 

Method 

Reverse transcriptase was added to poly (A)* RNA separated 
and purified from human adipose tissue using a Quick prep mRNA 
purification kit , followed by insertion into A ZAP II containing 
the pUC19 system vector pBluescript and introduction into 
Escherichia coli . Screening carried out using an adipose 
tissue-specific partial DNA as a probe gave a colony of 
transformant Escherichia coli . Then, together with two 
primers ( SK : 5 ' CGCTCTAGAACTAGTGGATC3 ' ; T7 : 

5 ' GTAATACGACTCACTATAGGGC3 ' ) , PGR [polymerase chain reaction; 
(30 seconds at 95^0 + 30 seconds at 50°C + 60 seconds at 70''C) 
X 15 cycles, followed by (30 seconds at 95°C + 60 seconds at 
70X1) X 15 cycles], was carried out and, after sonication, the 
product was subcloned in M13 . A primer dye was added thereto 
and, after purification, the nucleotide sequence of the DNA was 
analyzed using an automated sequencer. The nucleotide sequence 
obtained is shown in the sequence listing under SEQ ID NO: 2. 



Example 2 

Kyami nation as to water permeability 

As for the studies on the water permeability of the AQP 
family, there are reports about AQPl [Science, 256 , 385-387 
(1992)] and AQP 3 [Proc. Natl. Acad. Scl. USA, £1, 6269-6273 
( 1994 ) ] In which RNA was Introduced into Xenopus laevis oocytes , 
in which AQP family genes have not been expressed, and then the 
water permeability was calculated from changes in the surface 
area and volume of said oocytes in a hypotonic culture medium. 
Therefore, the membrane protein encoded by the DNA of the 
present invention was checked for water permeability according 
to the method described in the references cited above. 

Test method 

The RNA (10 ng) obtained in Example 1 from human adipose 
tissue was introduced into Xenopus laevis oocytes by 
microinjection and the oocytes were incubated at 20°C for 3 days 
in an isotonic culture medium (about 200 mOsm) . The cultured 
oocytes were transferred to a hypotonic culture medium (about 
40 mOsm) . Photograph was taken 20 seconds and 40 seconds after 
transfer, and the sectional area and volume of each oocyte were 
determined using an image analyzer. The water permeability of 
the membrane protein was calculated as follows : 

Permeability (cm/sec) = [ (V40-V20) /20] / [ (A20X 
10'^x4)xl.384] 

where V20 denotes the oocyte volume (cm^) after 20 seconds, V40 
denotes the oocyte volume (cm^) after 40 seconds, and A20 denotes 
the oocyte sectional area (mm^) after 20 seconds. 

The result is shown in Table 1 . The water permeability 
found when purified water was used for microinjection in lieu 
of the RNA is also shown. 

Further, the expression of the polypeptide of the present 
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invention in said oocytes was confirmed by a C-teiminal region 
immunoassay using the rabbit antiserum obtained by using the 
polypeptide of the present invention as synthesized la vitro . 



Table 1 





Water peimeability (cm/ sec) 


Group in which RNA 
introduction was not made 


30.0x10"'' 


Group in which RNA 
introduction was made 


292.5x10'"' 



As is evident from Table 1, the polypeptide encoded by 
the DNA sequence of the present invention caused an increase 
in water permeability after introduction into the oocytes. 
This result indicate that the polypeptide of the present 
invention has water channel activity. 

INDUSTRIAL UTILIZABILITY 
The present invention provides a novel protein having 
water channel activity and a novel DNA sequence encoding the 
protein. Said protein is one found in human adipose tissue for 
which the occurrence of water channels has not been reported 
as yet. The present invention makes it possible to develop 
novel therapies for water or fat metabolism-associated diseases 
in which said tissue is involved. 
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[Sequence listings] 



SEQ ID No . : 1 
Lengtli: 342 
Type: amino acid. 
Topology: linear 
Species: peptide 



Sequence 

Met Val Gin Ala Ser Gly His Arg Arg Ser Thr Arg Gly Ser Lys Met 

5 10 15 

Val Ser Trp Ser Val He Ala Lys He Gin Glu He Leu Gin Arg Lys 

20 25 30 

Met Val Arg Glu Phe Leu Ala Glu Phe Met Ser Thr Tyr Val Met Met 

35 40 45 

Val Phe Gly Leu Gly Ser Val Ala His Met Val Leu Asn Lys Lys Tyr 

50 55 60 

Gly Ser Tyr Leu Gly Val Asn Leu Gly Phe Gly Phe Gly Val Thr Met 
65 70 75 80 

Gly Val His Val Ala Gly Arg He Ser Gly Ala His Met Asn Ala Ala 

85 90 95 

Val Thr Phe Ala Asn Cys Ala Leu Gly Arg Val Pro Trp Arg Lys Phe 

100 105 110 

Pro Val Tyr Val Leu Gly Gin Phe Leu Gly Ser Phe Leu Ala Ala Ala 

115 120 125 

Thr He Tyr Ser Leu Phe Tyr Thr Ala He Leu His Phe Ser Gly Gly 

130 135 140 

Gin Leu Met Val Thr Gly Pro Val Ala Thr Ala Gly He Phe Ala Thr 
145 150 155 160 

Tyr Leu Pro Asp His Met Thr Leu Trp Arg Gly Phe Leu Asn Glu Ala 

165 170 175 

Trp Leu Thr Gly Met Leu Gin Leu Cys Leu Phe Ala Thr Thr Asp Gin 

180 185 190 

Glu Asn Asn Pro Ala Leu Pro Gly Thr Glu Ala Leu Val He Gly He 
195 200 205 
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Leu Val Val lie He 
210 

He Asn Pro Ser Arg 
225 

Gly Trp Gly Lys Gin 
245 

Pro Val Val Ala Pro 
260 

Leu Val Phe He Gly 
275 

Asp Ser Val Ala Tyr 
290 

Gly Ser His Glu Pro 
305 

Pro Ala Asn Arg Ser 
325 

Met Ala Leu Glu His 
340 



Gly Val Ser Leu Gly 
215 

Asp Leu Pro Pro Arg 
230 

Val Phe Ser Asn Gly 
250 

Leu Leu Gly Ala Tyr 
265 

Ser Thr He Pro Arg 
280 

Glu Asp His Gly He 
295 

Thr He Ser Pro Leu 
310 

Ser Val His Pro Ala 
330 

Phe 



Met Asn Thr Gly Tyr Ala 
220 

He Phe Thr Phe He Ala 
235 240 
Glu Asn Trp Trp Trp Val 
255 

Leu Gly Gly He He Tyr 
270 

Glu Pro Leu Lys Leu Glu 
285 

Thr Val Leu Pro Lys Met 
300 

Thr Pro Val Ser Val Ser 
315 320 
Pro Pro Leu His Glu Ser 
335 
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in 20 

m . 



SEQ ID No: 2 
Length: 1258 
Type: nucleotide 
Strandedness : double strand 
5 Topology: linear 

Species: cDNA to mRNA 
Original source 

Organism: human 

Tissue type: adipose tissue 
10 Feature Information 

Name/Key: peptide 

Location: 173. .1198 

Identification method: E 

15 Sequence 

GGCTCTGGAC TGGGGACACA GGGATAGCTG AGCCCCAGCT GGGGGTGGAA GCTGAGCCAG 50 
GGACAGTCAC GGAGGAACAA GATCAAGATG CGCTGTAACT GAGAAGCCCC CAAGGCGGAG 120 
GCTGAGAATC AGAGACATTT CAGCAGACAT CTACAAATCT GAAAGACAAA AC ATG GTT 178 

Met Val 
1 

CAA GCA TOO GGG CAC AGG CGG TOG AGO CGT GGC TCC AAA ATG GTC TCC 226 
Gin Ala Ser Gly His Arg Arg Ser Thr Arg Gly Ser Lys Met Val Ser 

5 10 15 

TGG TCC GTG ATA GCA AAG ATC GAG GAA ATA CTG CAG AGG AAG ATG GTG 274 
25 Trp Ser Val He Ala Lys He Gin Glu He Leu Gin Arg Lys Met Val 
20 25 30 

CGA GAG TTC CTG GCC GAG TTC ATG AGC ACA TAT GTC ATG ATG GTA TTC 322 
Arg Glu Phe Leu Ala Glu Phe Met Ser Thr Tyr Val Met Met Val Phe 
35 40 45 50 

30 GGC CTT GGT TCC GTG GCC CAT ATG GTT CTA AAT AAA AAA TAT GGG AGC 370 
Gly Leu Gly Ser Val Ala His Met Val Leu Asn Lys Lys Tyr Gly Ser 

55 60 55 

TAG CTT GGT GTC AAC TTG GGT TTT GGC TTC GGA GTC ACC ATG GGA GTG 418 
Tyr Leu Gly Val Asn Leu Gly Phe Gly Phe Gly Val Thr Met Gly Val 
35 70 75 80 
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CAC GTG GCA GGC CGC ATC TCT GGA GCC CAC ATG AAC GCA GCT GTG ACC 466 
His Val Ala Gly Arg He Ser Gly Ala His Met Asn Ala Ala Val Thr 

85 90 95 

TTT GCT AAC TGT GCG CTG GGC CGC GTG CCC TGG AGG AAG TTT CCG GTC 514 
5 Phe Ala Asn Cys Ala Leu Gly Arg Val Pro Trp Arg Lys Phe Pro Val 
100 105 110 

TAT GTG CTG GGG CAG TTC CTG GGC TCC TTC CTG GCG GCT GCC ACC ATC 562 
Tyr Val Leu Gly Gin Phe Leu Gly Ser Ptie Leu Ala Ala Ala Thr He 
115 120 125 130 

10 TAG AGT CTC TTC TAC ACG GCC ATT CTC CAC TTT TCG GGT GGA CAG CTG 610 
Tyr Ser Leu Phe Tyr Thr Ala He Leu His Phe Ser Gly Gly Gin Leu 
135 140 145 

f3 ATG GTG ACC GGT CCC GTC GCT ACA GCT GGC ATT TTT GCC ACC TAC CTT 658 

C Met Val Thr Gly Pro Val Ala Thr Ala Gly He Phe Ala Thr Tyr Leu 

2 15 150 155 160 

h CCT GAT CAC ATG ACA TTG TGG CGG GGC TTC CTG AAT GAG GCG TGG CTG 706 

Jf Pro Asp His Met Thr Leu Trp Arg Gly Phe Leu Asn Glu Ala Trp Leu 

5 165 170 175 

= ACC GGG ATG CTC CAG CTG TGT CTC TTC GCC ATC ACG GAC CAG GAG AAC 754 

m 20 Thr Gly Met Leu Gin Leu Cys Leu Phe Ala Thr Thr Asp Gin Glu Asn 
D 180 185 190 

2 AAC CCA GCA CTG CCA GGA ACA GAG GCG CTG GTG ATA GGC ATC CTC GTG 802 

U, Asn Pro Ala Leu Pro Gly Thr Glu Ala Leu Val He Gly He Leu Val 

195 200 205 210 

25 GTC ATC ATC GGG GTG TCC CTT GGC ATG AAC ACA GGA TAT GCC ATC AAC 850 
Val He He Gly Val Ser Leu Gly Met Asn Thr Gly Tyr Ala He Asn 

215 220 225 

CCG TCC CGG GAC CTG CCC CCC CGC ATC TTC ACC TTC ATT GCT GGT TGG 898 
Pro Ser Arg Asp Leu Pro Pro Arg He Phe Thr Phe He Ala Gly Trp 
30 230 235 240 

GGC AAA CAG GTC TTC AGC AAT GGG GAG AAC TGG TGG TGG GTG CCA GTG 946 
Gly Lys Gin Val Phe Ser Asn Gly Glu Asn Trp Trp Trp Val Pro Val 

245 250 255 

GTG GCA CCA CTT CTG GGT GCC TAT CTA GGT GGC ATC ATC TAC CTG GTC 994 
35 Val Ala Pro Leu Leu .Gly Ala Tyr Leu Gly Gly He He Tyr Leu Val 
260 265 270 
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TTC ATT GGC TCC ACC ATC CCA CGG GAG CCC CTG AAA TTG GAG GAT TCT 1042 
Phe lie Gly Ser Thr lie Pro Arg Glu Pro Leu Lys Leu Glu Asp Ser 
275 280 285 290 

GTG GCG TAT GAA GAG CAC GGG ATA ACC GTA TTG CCC AAG ATG GGA TCT 1090 
5 Val Ala Tyr Glu Asp His Gly lie Thr Val Leu Pro Lys Met Gly Ser 
295 300 305 

CAT GAA CCC ACG ATC TCT CCC CTC ACC CCC GTC TCT GTG AGC CCT GCC 1138 
His Glu Pro Thr He Ser Pro Leu Thr Pro Val Ser Val Ser Pro Ala 
310 315 320 

10 AAC AGA TCT TCA GTC CAC CCT GCC CCA CCC TTA CAT GAA TCC ATG GCC 1186 
Asn Arg Ser Ser Val His Pro Ala Pro Pro Leu His Glu Ser Met Ala 

325 330 335 

CTA GAG CAC TTC TAAGCAGAGA TTATTTGTGA TCCCATCCAT TCCCCAATAA 1238 
Leu Glu His Phe 
15 340 

AGCAAGGCTT GTCCGACAAA 1258 



